Search CORE

27 research outputs found

Learning Systems : Stochastic Automata Models

Author: Thathachar M.A.L.
Publication venue: 'Defence Scientific Information and Documentation Centre'
Publication date: 24/01/2014
Field of study

Stochastic automata operating in an unknown random environment have been successfully used in modelling learning systems. For such learning automata a new algorithm which estimates the environmental characteristics is presented and is used in the solution of a pattern classification problem with unknown class-conditional densities. The algorithm selects the optimal threshold asymptotically with probability one

Defence Science Journal

Modeling a teacher in a tutorial-like system using Learning Automata

Author: A. Baddeley
A. Vasilakos
A.F. Atlassis
A.S. Poznyak
A.V. Vasilakos
B.J. Oommen
B.J. Oommen
B.J. Oommen
B.J. Oommen
B.J. Oommen
C. Bettstetter
C. Unsal
D.J. Bertsimas
E. Fischetti
F. Seredynski
J. Self
J.J. Collins
J.P. Cusumano
K. Najim
K.S. Narendra
M. Agache
M. Barzohar
M. Paola
M.A.L. Thathachar
M.A.L. Thathachar
M.L. Brandeau
M.L. Tsetlin
M.R. Meybodi
M.S. Obaidat
M.S. Obaidat
R.L. Cook
S. Misra
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2012
Field of study

The goal of this paper is to present a novel approach to model the behavior of a Teacher in a Tutorial- like system. In this model, the Teacher is capable of presenting teaching material from a Socratic-type Domain model via multiple-choice questions. Since this knowledge is stored in the Domain model in chapters with different levels of complexity, the Teacher is able to present learning material of varying degrees of difficulty to the Students. In our model, we propose that the Teacher will be able to assist the Students to learn the more difficult material. In order to achieve this, he provides them with hints that are relative to the difficulty of the learning material presented. This enables the Students to cope with the process of handling more complex knowledge, and to be able to learn it appropriately. To our knowledge, the findings of this study are novel to the field of intelligent adaptation using Learning Automata (LA). The novelty lies in the fact that the learning system has a strategy by which it can deal with increasingly more complex/difficult Environments (or domains from which the learning as to be achieved). In our approach, the convergence of the Student models (represented by LA) is driven not only by the response of the Environment (Teacher), but also by the hints that are provided by the latter. Our proposed Teacher model has been tested against different benchmark Environments, and the results of these simulations have demonstrated the salient aspects of our model. The main conclusion is that Normal and Below-Normal learners benefited significantly from the hints provided by the Teacher, while the benefits to (brilliant) Fast learners were marginal. This seems to be in-line with our subjective understanding of the behavior of real-life Students

Crossref

Carleton University's Institutional Repository

NORA - Norwegian Open Research Archives

Agder University Research Archive

On the L2-stability of a class of nonlinear systems

Author: Sundareshan M.K
Thathachar M.A.L
Publication venue: Published by Elsevier Inc.
Publication date: 30/06/1973
Field of study

AbstractSufficient conditions are given for the L2-stability of a class of feedback systems consisting of a linear operator G and a nonlinear gain function, either odd monotone or restricted by a power-law, in cascade, in a negative feedback loop. The criterion takes the form of a frequency-domain inequality, Re[1 + Z(jω)] G(jω) ⩾ δ > 0 ∀ω ϵ (−∞, +∞), where Z(jω) is given by, Z(jω) = β[Y1(jω) + Y2(jω)] + (1 − β)[Y3(jω) − Y3(−jω)], with 0 ⩽ β ⩽ 1 and the functions y1(·), y2(·) and y3(·) satisfying the time-domain inequalities, ∝−∞+∞¦y1(t) + y2(t)¦ dt ⩽ 1 − ϵ, y1(·) = 0, t < 0, y2(·) = 0, t > 0 and ϵ > 0, and ∝0∞¦y3(t)¦ dt < 12c2, c2 being a constant depending on the order of the power-law restricting the nonlinear function. The criterion is derived using Zames' passive operator theory and is shown to be more general than the existing criteria

Elsevier - Publisher Connector

Open Access Repository of IISc Research Publications

Empirical Verification of a Strategy for Unbounded Resolution in Finite Player Goore Games

Author: B. Tung
K.S. Narendra
M.A.L. Thathachar
M.A.L.T. Thathachar
M.L. Tsetlin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Crossref

A New Class of ε-Optimal Learning Automata

Author: A.A. Econdomides
A.A. Economides
B.J. Oommen
G.I. Papadimitriou
M.A.L. Thathachar
M.A.L. Thathachar
M.L. Tsetlin
R.S. Shutton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref

A two-armed bandit collective for examplar based mining of frequent itemsets with applications to intrusion detection

Author: H. Wang
M.A.L. Thathachar
M.L. Tsetlin
M.V. Mahoney
R. Lippmann
R.S. Sutton
Publication venue: Springer Berlin / Heidelberg
Publication date: 01/01/2011
Field of study

Over the last decades, frequent itemset mining has become a major area of research, with applications including indexing and similarity search, as well as mining of data streams, web, and software bugs. Although several efficient techniques for generating frequent itemsets with a minimum support (frequency) have been proposed, the number of itemsets produced is in many cases too large for effective usage in real-life applications. Indeed, the problem of deriving frequent itemsets that are both compact and of high quality, remains to a large degree open. In this paper we address the above problem by posing frequent itemset mining as a collection of interrelated two-armed bandit problems. In brief, we seek to find itemsets that frequently appear as subsets in a stream of itemsets, with the frequency being constrained to support granularity requirements. Starting from a randomly or manually selected examplar itemset, a collective of Tsetlin automata based two-armed bandit players aims to learn which items should be included in the frequent itemset. A novel reinforcement scheme allows the bandit players to learn this in a decentralized and on-line manner by observing one itemset at a time. Since each bandit player learns simply by updating the state of a finite automaton, and since the reinforcement feedback is calculated purely from the present itemset and the corresponding decisions of the bandit players, the resulting memory footprint is minimal. Furthermore, computational complexity grows merely linearly with the cardinality of the examplar itemset. The proposed scheme is extensively evaluated using both artificial data as well as data from a real-world network intrusion detection application. The results are conclusive, demonstrating an excellent ability to find frequent itemsets at various level of support. Furthermore, the sets of frequent itemsets produced for network instrusion detection are compact, yet accurately describe the different types of network traffic present

Crossref

NORA - Norwegian Open Research Archives

Agder University Research Archive

Learning Automata Based Cooperative Student-Team in Tutorial-Like System

Author: B.J. Oommen
B.J. Oommen
G. Horn
G.I. Papadimitriou
M. Agache
M.A.L. Thathachar
M.A.L. Thathachar
S. Misra
W. Yuan
X. Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

On the L2-stability of a class of nonlinear systems

Author: Cho
Dunford
M.A.L Thathachar
M.K Sundareshan
O'Shea
Popov
Sundareshan
Thathachar
Zames
Zames
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Multiaction learning automata possessing ergodicity of the mean

Author: Oommen J. (B. John)
Thathachar M.A.L. (M. A.L.)
Publication venue: 'Elsevier BV'
Publication date: 01/01/1985
Field of study

Multiaction learning automata which update their action probabilities on the basis of the responses they get from an environment are considered in this paper. The automata update the probabilities according to whether the environment responds with a reward or a penalty. Learning automata are said to possess ergodicity of the mean if the mean action probability is the state probability (or unconditional probability) of an ergodic Markov chain. In an earlier paper [11] we considered the problem of a two-action learning automaton being ergodic in the mean (EM). The family of such automata was characterized completely by proving the necessary and sufficient conditions for automata to be EM. In this paper, we generalize the results of [11] and obtain necessary and sufficient conditions for the multiaction learning automaton to be EM. These conditions involve two families of probability updating functions. It is shown that for the automaton to be EM the two families must be linearly dependent. The vector defining the linear dependence is the only vector parameter which controls the rate of convergence of the automaton. Further, the technique for reducing the variance of the limiting distribution is discussed. Just as in the two-action case, it is shown that the set of absolutely expedient schemes and the set of schemes which possess ergodicity of the mean are mutually disjoint

Carleton University's Institutional Repository

TWO ACTION LEARNING AUTOMATA POSSESSING ERGODICITY OF THE MEAN.

Author: Oommen J. (B. John)
Thathachar M.A.L. (M. A L)
Publication venue
Publication date: 01/12/1983
Field of study

Carleton University's Institutional Repository